Research on Phoneme Sequences for Language Identification and Concurrent Voice Transmission
نویسنده
چکیده
Language Identification is process of identifying the language being spoken from a sample of speech by an unknown speaker. Most of the previous work in this field is based on the fact that phoneme sequences have different occurrence probabilities in different languages, and all the systems designed till now have tried to exploit this fact. Language identification process in turn consists of two sub-systems. First system converts speech into some intermediate form called as phoneme sequences, which are used to model the language by doing their probabilistic analysis in the second sub-system. In this project both of the sub-systems are targeted. First some algorithms are discussed for designing language models. Then an attempt is made to design an algorithm for extracting phoneme sequences in form of more abstract classes derived by statistical tools like Gaussian Mixture Models (GMM) and Hidden Markov Model (HMM).
منابع مشابه
Language Identification and Locale Based Web Browser: New Approach
―Voice recognition has come a long way in the last few years,‖ as said plain and simple by Judge Thomas C. Smith, author of Dictating to your Computer (for judges and lawyers) (1). What was once a sceptical and highly inaccurate, almost useless device, voice recognition is now an incredibly helpful and advanced tool. Voice recognition is defined by www.techtarget.com as simply, ―the ability of ...
متن کاملUsing Context-based Statistical Models to Promote the Quality of Voice Conversion Systems
This article aims to examine methods of optimizing GMM-based voice conversion systems performance in which GMM method is introduced as the basic method for improvement of voice conversion systems performance. In the current methods, due to using a single conversion function to convert all speech units and subsequent spectral smoothing arising from statistical averaging, we will observe quality ...
متن کاملImproving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM
Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...
متن کاملAutomatic Speaker and Language Identification
This thesis deals with the problem of automatic language identification (LID) and automatic speaker identification (SID) given the speech signal as input. Both researches have received renewed interest due to heightened homeland security awareness, e.g. in the use of speaker’s voice print for biometric identification, language identification for the classification of speech archives and call-sc...
متن کاملPump-priming PASCAL proposal: Large Margin Algorithms and Kernel Methods for Speech Applications
Research on large margin algorithms in conjunctions with kernels methods has been both exciting and successful. While there have been quite a few preliminary successes in applying kernel methods for speech applications, most the research efforts have focused on non-temporal problems such as text classification and optical character recognition (OCR). We propose to design, analyze, and implement...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014